15 research outputs found

    Evaluating openEHR for storing computable representations of electronic health record phenotyping algorithms

    Get PDF
    Electronic Health Records (EHR) are data generated during routine clinical care. EHR offer researchers unprecedented phenotypic breadth and depth and have the potential to accelerate the pace of precision medicine at scale. A main EHR use-case is creating phenotyping algorithms to define disease status, onset and severity. Currently, no common machine-readable standard exists for defining phenotyping algorithms which often are stored in human-readable formats. As a result, the translation of algorithms to implementation code is challenging and sharing across the scientific community is problematic. In this paper, we evaluate openEHR, a formal EHR data specification, for computable representations of EHR phenotyping algorithms.Comment: 30th IEEE International Symposium on Computer-Based Medical Systems - IEEE CBMS 201

    Evaluation of Semantic Web Technologies for Storing Computable Definitions of Electronic Health Records Phenotyping Algorithms

    Get PDF
    Electronic Health Records are electronic data generated during or as a byproduct of routine patient care. Structured, semi-structured and unstructured EHR offer researchers unprecedented phenotypic breadth and depth and have the potential to accelerate the development of precision medicine approaches at scale. A main EHR use-case is defining phenotyping algorithms that identify disease status, onset and severity. Phenotyping algorithms utilize diagnoses, prescriptions, laboratory tests, symptoms and other elements in order to identify patients with or without a specific trait. No common standardized, structured, computable format exists for storing phenotyping algorithms. The majority of algorithms are stored as human-readable descriptive text documents making their translation to code challenging due to their inherent complexity and hinders their sharing and re-use across the community. In this paper, we evaluate the two key Semantic Web Technologies, the Web Ontology Language and the Resource Description Framework, for enabling computable representations of EHR-driven phenotyping algorithms.Comment: Accepted American Medical Informatics Association Annual Symposium 201

    Comparing clinical trial population representativeness to real-world populations: an external validity analysis encompassing 43 895 trials and 5 685 738 individuals across 989 unique drugs and 286 conditions in England

    Get PDF
    BACKGROUND: Randomised controlled trials (RCTs) inform prescription guidelines, but stringent eligibility criteria exclude individuals with vulnerable characteristics, which we define as comorbidities, concomitant medication use, and vulnerabilities due to age. Poor external validity can result in inadequate treatment decision information. Our first aim was to quantify the extent of exclusion of individuals with vulnerable characteristics from RCTs for all prescription drugs. Our second aim was to quantify the prevalence of individuals with vulnerable characteristics from population electronic health records who are actively prescribed such drugs. In tandem, these two aims will allow us to assess the representativeness between RCT and real-world populations and identify vulnerable populations potentially at risk of inadequate treatment decision information. When a vulnerable population is highly excluded from RCTs but has a high prevalence of individuals actively being prescribed the same medication, there is likely to be a gap in treatment decision information. Our third aim was to investigate the use of real-world evidence in contributing towards quantifying missing treatment risk or benefit through an observational study. METHODS: We extracted RCTs from ClinicalTrials.gov from its inception to April 28, 2021, and primary care records from the Clinical Practice Research Datalink Gold database from Jan 1, 1998, to Dec 31, 2020. We referred to the British National Formulary to classify prescription drugs into drug categories. We conducted descriptive analyses and quantified RCT exclusion and prevalence of individuals with vulnerable characteristics for comparison to identify populations without treatment decision information. Exclusion and prevalence were assessed separately for different age groups, individual clinical specialities, and for quantities of concomitant conditions by clinical specialities, where multimorbidity was defined as having two or more clinical specialties, and medications prescribed, where polypharmacy was defined as having five or more medications prescribed. Population trends of individuals with multimorbidity or polypharmacy were assessed separately by age group. We conducted an observational cohort study to validate the use of real-world evidence in contributing towards quantifying treatment risk or benefit for patients with dementia on anti-dementia drugs with and without a contraindicated clinical speciality. To do so, we identified the clinical specialities that anti-dementia drug RCTs highly excluded yet had corresponding high prevalence in the real-world population, forming the groups with highest risk of having scarce treatment decision information. Cox regression was used to assess if the risk of mortality outcomes differs between both groups. FINDINGS: 43 895 RCTs from ClinicalTrials.gov and 5 685 738 million individuals from primary care records were used. We considered 989 unique drugs and 286 conditions across 13 drug-category cohorts. For the descriptive analyses, the median RCT exclusion proportion across 13 drug categories was 81·5% (IQR 76·7-85·5) for adolescents (aged <18 years), 26·3% (IQR 21·0-29·5) for individuals older than 60 years, 40·5% (IQR 33·7-43·0) for individuals older than 70 years, and 52·9% (IQR 47·1-56·0) for individuals older than 80 years. Multimorbidity had a median exclusion proportion of 91·1% (IQR 88·9-91·8) and median prevalence of 41·0% (IQR 34·9-46·0). Concomitant medication use had a median exclusion proportion of 52·5% (IQR 50·0-53·7) and a median prevalence of 94·3% (IQR 84·3-97·2), and polypharmacy had a median prevalence of 47·7% (IQR 38·0-56·1). Population trends show increasing multimorbidity with age and consistently high polypharmacy across age groups. Populations with cardiovascular or otorhinolaryngological comorbidities had the highest risk of having scarce treatment decision information. For the observational study, populations with cardiovascular or psychiatric comorbidities had highest risk of having scarce treatment decision information. Patients with dementia with an anti-dementia prescription and contraindicated cardiovascular condition had a higher risk of mortality (hazard ratio [HR] 1·20 [95% CI 1·13-1·28 ; p<0·0001]) compared with patients with dementia without a contraindicated cardiovascular condition. Patients with dementia with comorbid delirium (HR 1·25 [95% CI 1·06-1·48]; p<0·0088), intellectual disability (HR 2·72 [95% CI 1·53-4·81]; p=0·0006), and schizophrenia and schizotypal delusional disorders (HR 1·36 [95% CI 1·02-1·82]; p=0·036) had a higher risk of mortality compared with patients with dementia without these conditions. INTERPRETATION: Overly stringent RCT exclusion criteria do not appropriately account for the heterogeneity of vulnerable characteristics observed in real-world populations. Treatment decision information is scarce for such individuals, which might affect health outcomes. We discuss the challenges facing the inclusivity of such individuals and highlight the strength of real-world evidence as an integrative solution in complementing RCTs and increasing the completeness of evidence-based medicine assessments in evaluating the effectiveness of treatment decisions. FUNDING: Wellcome Trust, National Institute for Health Research (NIHR) University College London Hospitals Biomedical Research Centre, NIHR Great Ormond Street Hospital Biomedical Research Centre, Academy of Medical Sciences, and the University College London Overseas Research Scholarship

    Phenotype Algorithms for the Identification and Characterization of Vaccine-Induced Thrombotic Thrombocytopenia in Real World Data: A Multinational Network Cohort Study

    Get PDF
    INTRODUCTION: Vaccine-induced thrombotic thrombocytopenia (VITT) has been identified as a rare but serious adverse event associated with coronavirus disease 2019 (COVID-19) vaccines. OBJECTIVES: In this study, we explored the pre-pandemic co-occurrence of thrombosis with thrombocytopenia (TWT) using 17 observational health data sources across the world. We applied multiple TWT definitions, estimated the background rate of TWT, characterized TWT patients, and explored the makeup of thrombosis types among TWT patients. METHODS: We conducted an international network retrospective cohort study using electronic health records and insurance claims data, estimating background rates of TWT amongst persons observed from 2017 to 2019. Following the principles of existing VITT clinical definitions, TWT was defined as patients with a diagnosis of embolic or thrombotic arterial or venous events and a diagnosis or measurement of thrombocytopenia within 7 days. Six TWT phenotypes were considered, which varied in the approach taken in defining thrombosis and thrombocytopenia in real world data. RESULTS: Overall TWT incidence rates ranged from 1.62 to 150.65 per 100,000 person-years. Substantial heterogeneity exists across data sources and by age, sex, and alternative TWT phenotypes. TWT patients were likely to be men of older age with various comorbidities. Among the thrombosis types, arterial thrombotic events were the most common. CONCLUSION: Our findings suggest that identifying VITT in observational data presents a substantial challenge, as implementing VITT case definitions based on the co-occurrence of TWT results in large and heterogeneous incidence rate and in a cohort of patints with baseline characteristics that are inconsistent with the VITT cases reported to date

    Phenotype Algorithms for the Identification and Characterization of Vaccine-Induced Thrombotic Thrombocytopenia in Real World Data:A Multinational Network Cohort Study

    Get PDF
    INTRODUCTION: Vaccine-induced thrombotic thrombocytopenia (VITT) has been identified as a rare but serious adverse event associated with coronavirus disease 2019 (COVID-19) vaccines. OBJECTIVES: In this study, we explored the pre-pandemic co-occurrence of thrombosis with thrombocytopenia (TWT) using 17 observational health data sources across the world. We applied multiple TWT definitions, estimated the background rate of TWT, characterized TWT patients, and explored the makeup of thrombosis types among TWT patients. METHODS: We conducted an international network retrospective cohort study using electronic health records and insurance claims data, estimating background rates of TWT amongst persons observed from 2017 to 2019. Following the principles of existing VITT clinical definitions, TWT was defined as patients with a diagnosis of embolic or thrombotic arterial or venous events and a diagnosis or measurement of thrombocytopenia within 7 days. Six TWT phenotypes were considered, which varied in the approach taken in defining thrombosis and thrombocytopenia in real world data. RESULTS: Overall TWT incidence rates ranged from 1.62 to 150.65 per 100,000 person-years. Substantial heterogeneity exists across data sources and by age, sex, and alternative TWT phenotypes. TWT patients were likely to be men of older age with various comorbidities. Among the thrombosis types, arterial thrombotic events were the most common. CONCLUSION: Our findings suggest that identifying VITT in observational data presents a substantial challenge, as implementing VITT case definitions based on the co-occurrence of TWT results in large and heterogeneous incidence rate and in a cohort of patints with baseline characteristics that are inconsistent with the VITT cases reported to date. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s40264-022-01187-y

    Transforming and evaluating electronic health record disease phenotyping algorithms using the OMOP common data model: a case study in heart failure

    Get PDF
    OBJECTIVE: The aim of the study was to transform a resource of linked electronic health records (EHR) to the OMOP common data model (CDM) and evaluate the process in terms of syntactic and semantic consistency and quality when implementing disease and risk factor phenotyping algorithms. MATERIALS AND METHODS: Using heart failure (HF) as an exemplar, we represented three national EHR sources (Clinical Practice Research Datalink, Hospital Episode Statistics Admitted Patient Care, Office for National Statistics) into the OMOP CDM 5.2. We compared the original and CDM HF patient population by calculating and presenting descriptive statistics of demographics, related comorbidities, and relevant clinical biomarkers. RESULTS: We identified a cohort of 502 536 patients with the incident and prevalent HF and converted 1 099 195 384 rows of data from 216 581 914 encounters across three EHR sources to the OMOP CDM. The largest percentage (65%) of unmapped events was related to medication prescriptions in primary care. The average coverage of source vocabularies was >98% with the exception of laboratory tests recorded in primary care. The raw and transformed data were similar in terms of demographics and comorbidities with the largest difference observed being 3.78% in the prevalence of chronic obstructive pulmonary disease (COPD). CONCLUSION: Our study demonstrated that the OMOP CDM can successfully be applied to convert EHR linked across multiple healthcare settings and represent phenotyping algorithms spanning multiple sources. Similar to previous research, challenges mapping primary care prescriptions and laboratory measurements still persist and require further work. The use of OMOP CDM in national UK EHR is a valuable research tool that can enable large-scale reproducible observational research
    corecore